Search CORE

19 research outputs found

Combining MEDLINE and publisher data to create parallel corpora for the automatic translation of biomedical text

Author: Neveol A
Prieur-Gaston E
Yepes AJ
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/04/2013
Field of study

BACKGROUND: Most of the institutional and research information in the biomedical domain is available in the form of English text. Even in countries where English is an official language, such as the United States, language can be a barrier for accessing biomedical information for non-native speakers. Recent progress in machine translation suggests that this technique could help make English texts accessible to speakers of other languages. However, the lack of adequate specialized corpora needed to train statistical models currently limits the quality of automatic translations in the biomedical domain. RESULTS: We show how a large-sized parallel corpus can automatically be obtained for the biomedical domain, using the MEDLINE database. The corpus generated in this work comprises article titles obtained from MEDLINE and abstract text automatically retrieved from journal websites, which substantially extends the corpora used in previous work. After assessing the quality of the corpus for two language pairs (English/French and English/Spanish) we use the Moses package to train a statistical machine translation model that outperforms previous models for automatic translation of biomedical text. CONCLUSIONS: We have built translation data sets in the biomedical domain that can easily be extended to other languages available in MEDLINE. These sets can successfully be applied to train statistical machine translation models. While further progress should be made by incorporating out-of-domain corpora and domain-specific lexicons, we believe that this work improves the automatic translation of biomedical texts

University of Melbourne Institutional Repository

Understanding PubMed(R) user search behavior through log analysis

Author: A. Neveol
G. C. Murray
Madle
Madle
R. Islamaj Dogan
Roy
Z. Lu
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

French Infobutton: an academic and... business perspective

Author: Dahamna Badisse
Darmoni Stéfan
Derville A.
Kerdelhué Gaétan
Letord Catherine
Massari Philippe
Neveol Aurelie
Pereira Suzanne
Piot Josette
Thirion Benoit
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

International audienc

HAL - Normandie Université

Findings of the WMT 2021 Biomedical Translation Shared Task: Summaries of Animal Experiments as New Test Set

Author: Bawden R.
de Vinaspre O. P.
Di Nunzio G. M.
Grozea C.
Mah N.
Martinez D.
Navarro M. V.
Neveol A.
Neves M.
Oronoz M.
Roller R.
Siu A.
Thomas P.
Unanue I. J.
Vezzani F.
Wiemann D.
Yeganova L.
Yepes A. J.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2021
Field of study

In the sixth edition of the WMT Biomedical Task, we addressed a total of eight language pairs, namely English/German, English/French, English/Spanish, English/Portuguese, English/Chinese, English/Russian, English/Italian, and English/Basque. Further, our tests were composed of three types of textual test sets. New to this year, we released a test set of summaries of animal experiments, in addition to the test sets of scientific abstracts and terminologies. We received a total of 107 submissions from 15 teams from 6 countries

Archivio istituzionale della ricerca - Università di Padova

Automatic identification and normalization of dosage forms in drug monographs

Author: A Neveol
EG Poon
H Xu
J Li
Jiao Li
JJ Cimino
L Peters
L Zhou
LJ Jensen
O Uzuner
R Harpaz
R Venkataramanan
S Ananiadou
X Wang
Zhiyong Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A MEDLINE categorization algorithm

Author: A Neveol
Aurelie Névéol
B Thirion
Badisse Dahamna
Benoit Thirion
Jean-Francois Gehanno
Jean-Marie Renard
Lina F Soualmia
M Dekkers
M Douyère
M Jenkins
O Bodenreider
P Devos
SJ Nelson
SM Humphrey
Stefan J Darmoni
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Assisted annotation of medical free text using RapTAT

Author: A. Weaver
Aberdeen
Chen
D. Giuse
G. T. Gobbel
H. Xu
J. Garvin
J. Heavirland
J. Williams
Juckett
M. E. Matheny
Matheny
Murff
Neveol
R. M. Cronin
R. Reeves
Roberts
S. H. Brown
S. Jayaramaraja
T. Speroff
Publication venue: 'BMJ'
Publication date
Field of study

Crossref